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Abstract 



Consider a sequence of bit strings of length d, such that each string differs from the 
next in a constant number of bits. We call this sequence a quasi-Gray code. We 
examine the problem of efficiently generating such codes, by considering the number 
of bits read and written at each generating step, the average number of bits read 
while generating the entire code, and the number of strings generated in the code. 
Our results give a trade-off between these constraints, and present algorithms that 
do less work on average than previous results, and that increase the number of bit 
strings generated. 
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Chapter 1 
Introduction 



1.1 Problem Statement 

We are interested in efficiently generating a sequence of bit strings. The class of bit 
strings we wish to generate are cyclic quasi-Gray codes. A Gray code |Gra53j is a 
sequence of bit strings, such that any two consecutive strings differ in exactly one bit. 
We use the term quasi-Gray code |Fre78] to refer to a sequence of bit strings where 
any two consecutive strings differ in at most c bits, where c is a constant defined for 
the code. A Gray code (quasi-Gray code) is called cyclic if the ffist and last generated 
bit strings also differ in at most 1 bit (c bits). 

We say a bit string that contains d bits has dimension d, and are interested in 
efficient algorithms to generate a sequence of bit strings that form a quasi-Gray code 
of dimension d. After generating a bit string, we say the algorithm's data structure 
corresponds exactly to the generated bit string, and it's state is the bit string itself. 
In this way, we restrict an algorithm's data structure to using exactly d bits. At each 
step, the input to the algorithm will be a bit string, which is the algorithm's current 
state. The output will be a new bit string that corresponds to the next state of the 
algorithm's data structure. 

The number of consecutive unique bit strings generated is equal to the number of 
consecutive unique states for the generating data structure, and we call this value L, 
the length of the generated code. Clearly L < 2'^. We define the space efficiency of 
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an algorithm as the ratio that is, the fraction of bit strings generated out of 

all possible bit strings given the dimension of the strings. When the space efficiency 
is 1, we call the data structure space- optimal, as it generates all possible bit strings. 
When L < 2'^, the structure is non-space-optimal; as we will see, this allows the time 
required to generate each consecutive bit string to be improved. 

Each generating step takes as input the output of the previous generating step, 
which is a bit string in the quasi-Gray code. The average number of bits read is 
defined to be the ratio of the total number of bits read, to the length of the code, 
when generating one iteration of the entire quasi-Gray code. 

Our goal is to study and improve efficiency of algorithms for generating quasi-Gray 
codes in the following ways. 

1. Worst-Case read: We would like to know how many bits the algorithm must 
read in the worst case in order to make the appropriate changes in the input 
string and generate the next bit string in the code, and find ways to reduce this 
when possible. 

2. Worst-case write: We would like to know how many bits must change in the 
worst case to reach the successor string in the code, and keep this to 1 when 
possible while improving upon other metrics. 

3. Average number of bits read: We would like to reduce the average number of 
bits read at each generating step while maintaining other metrics. 

4. Space efficiency: We would like our algorithms to be as space efficient as pos- 
sible, ideally generating as many bit strings as their dimension allows, with 
L = 2^. 

Our results give a trade-off between these different goals. 

Our decision to limit the algorithm's data structure to exactly d bits differs from 
previous work, where the data structure could use more bits than the strings it gen- 
erated |Fre78l IRMOSj . To compare previous results to our own, we consider the extra 
bits in their data structure to be a part of their generated bit strings. This gives a 
more precise view of the space efficiency of an algorithm. 
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Each generated bit string of dimension d has a distinct totally ordered rank in the 
generated code with respect to the initial bit string in the code. For a cyclic code, 
the initial bit string can be chosen arbitrarily. We assume the initial bit string to be 
the bit string of d zeros unless stated otherwise. Given a string of rank A; in a code 
of length L, where < A; < L, we want to support the following operations: 

• next generates the bit string of rank {k + 1) mod L 

• previous generates the bit string of rank {k — 1) mod L 

We work within the bit probe model |MP69l IRMOSj . where the performance of 
an algorithm is measured by counting the average-case and the worst-case number 
of bits read and written. We examine these values for the process of generating 
each bit string in a quasi-Gray code. We use the Decision Assignment Tree (DAT) 
model |Fre78j (which we describe further in Chapter [3]) to construct algorithms for 
generating quasi-Gray codes and describe the algorithms' behaviour, as well as to 
discuss upper and lower bounds. 

1.2 Definitions 

We use a notation for the iterated logarithm function of the form log*^'^-' n where c is 
a non-negative integer, and is always surrounded by brackets to differentiate it from 
an exponent. The value of the function is defined as follows. When c = 0, log*-'^-' n = 
n. If c > 0, then log'''^^(n) = log'''^~^^(log(n)). For example, log*-^^ n = log(logn). 
Throughout, the base of the log function is assumed to be 2 unless stated otherwise. 

We define the function log* n to be equal to the smallest non-negative value of c 
such that log'''^^ n < 1. For example log* 1 = and log* 3 = 2. 



1.3 Results Summary 

Our results, as well as previous results, are summarized in Table 1.1 
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First, we present some space-optimal algorithms. Although our space-optimal 
algorithms read a small number of bits in the average case, they all read d bits in the 
worst case. 



In Section 4.1, we describe the Recursive Partition Gray Code (RPGC) algorithm, 
which generates a Gray code of dimension d while reading on average no more than 
6 log d bits. This improves the average number of bits read for a space-optimal Gray 



code from d to 0{\ogd). In Section 4.2, we use the RPGC to construct a DAT 
that generates a quasi-Gray code while reducing the average number of bits read. 



We then apply this technique iteratively in Section 4.3 to create, for any constant 
c > 1, a dimensional DAT that reads worst-case d bits, but reads on average only 
Glog'-^'^"^^ d + 11 bits, and writes at most c bits. This lowers the average number of 
bits read to generate a space-optimal quasi-Gray code from 0{logd) to 0(log''^'^^^-' d), 
when c > 2. 



In Section 4.4 we create a dimensional DAT that reads worst-case all d bits, 
while reading at most 17 bits on average, and writing at most [(log* d + 5)/2j bits 
to generate each bit string, while also being space-optimal. This reduces the average 
number of bits read to 0(1) for a space-optimal code, but increases the number of 
bits written to be slightly more than a constant. 

Next, we consider quasi-Gray codes that are not space-optimal, but achieve space 
efficiency arbitrarily close to 1, and that read 0{logd) bits in the worst case. 



In Section 4.5 we construct a DAT of dimension d = n + (t + l) \ogn that reads and 
writes {t + 1) logn-l-1 bits in the worst case, and has space efficiency l — 0{n~^). This 
improves the space efficiency dramatically of previous results, when the worst-case 
number of bits written is O(logn), from 1/2 to 1 in the limit ri — oo. By combining 
a simple Gray code with this result, we are able to produce a DAT of dimension 
d = n + {t + l)logn that reads 0{tlogn) bits on average and in the worst case, 
but writes at most 3 bits. This reduces the worst-case number of bits written from 
0(log?T,) to 0(1), while the space efficiency remains asymptotically the same. 



We then combine results from Section 4.3 to produce a DAT of dimension d = 
n + {t + 1) logn that reads in the worst case {t + 1) logn + 1 bits, reads on average 
121og^^'^^ n + 0(1) bits, and writes at most 2c + 1 bits, for any constant c > 1. This 
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improves the average number of bits read to generate a quasi-Gray code with 0(log d) 
bits read in the worst case. We reduce the average number of bits read from 0{\ogd) 
to O (log log (i) when writing the same number of bits, and when writing a constant 
number more, the average becomes 0(\og^'^^^ d), while keeping the space efficiency 
arbitrarily close to 1. Lastly, we show this DAT can also generate a code while 
reading a constant number of bits on average, if it writes log *n — 3 bits in the worst 
case. 

A summary of the results in this thesis is appearing at SWAT 2010, the 12th 
Scandinavian Symposium and Workshops on Algorithm Theory. 

1.4 Organization of the thesis 

The remainder of this work will be organized as follows. In Chapter |2| we review 
related previous work, and discuss its relationship to our own work. In Chapter [3} 
we present the Decision Assignment Tree model and make some observations about 
generating quasi-Gray codes within the model. In Chapter |4| we present our results, 
starting with the Recursive Partition Gray Code, followed by the RPGC-Composite 
quasi-Gray Code, and finally our Lazy Counters which build up to our Winelncrement 
counter. We conclude in Chapter |5| with a summary of our work and discussion of 
related open problems and future work. 



Chapter 2 



Previous work 



2.1 Gray codes 

The Gray code was invented by Frank Gray in 1953 |Gra53] . In his patent apphcation, 
Gray described what we now know as the Binary Reflected Gray Code (BRGC). This 
code is a sequence of bit strings of dimension d, where each successive bit string differs 
from the previous one in exactly one bit and where all the possible bit strings are 
present. From this code came the more general term, Gray code, which refers to any 
sequence of bit strings where successive strings differ in exactly one bit. Furthermore, 
the concept of a cyclic Gray code was used to describe a Gray code where the first 
and last bit strings differ in exactly one bit, such as the original BRGC, creating a 
secondary class of non-cyclic Gray codes. 

The BRGC has a structure which can be defined recursively. For a single bit, the 
code is simply followed by 1. To create a code of ci + 1 bits, given the code of d 
bits: first place the 2'^ bit strings of the dimension d in order. Concatenate a onto 
the left end of each of the bit strings. Then repeat the same 2°^ bit strings in reverse 



order, concatenating a 1 onto the left end of each one. Figure shows the BRGC 
for up to three bits. Note that the last bit string in the code differs from the first in 
a single bit, making the code a cyclic Gray code. 
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Figure 2.1: The standard Binary Reflected Gray Code for 1, 2, and 3 dimensions 



A furtlier generalization of Gray codes was provided by Fredman |Fre78j when he 
coined the term quasi- Gray code. In Fredman's work, a quasi-Gray code was defined 
to be a (i-dimensional bit string along with some unbounded additional data structure. 
The quasi-Gray code differs in each successive state in one bit of the d-dimensional 
bit string, but the algorithm may also make arbitrary changes to its additional data 
structure. In order to efficiently generate the successor bit string in a quasi-Gray 
code, the algorithm may make use of the additional data structure, reading fewer bits 
than it would otherwise need to. Fredman characterizes the efficiency of a generating 
algorithm by the worst-case number of bits read and written to generate each bit 
string in the code. Under this model, the algorithm will write a single bit in the 
worst case if and only if the algorithm does not use any additional data structure. In 
this case, Fredman notes that it would be generating a Gray code. 

We modify this definition slightly to improve the clarity of analysis. Because the 
additional data structure and the code itself are equally part of the algorithm's state, 
we consider Fredman's additional data structure to be a part of the generated code. 
We use the term quasi-Gray code to refer to a sequence of d-dimensional bit strings 
where successive bit strings each differ by at most a constant number of bits. This 
is similar to Fredman's model, but with a stronger limitation on the number of bits 



2.2 Quasi-Gray codes 
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being written, as his model allowed an arbitrary number of bits to be written while 
still considering it a quasi-Gray code. Thus an algorithm that, under Fredman's 
definition, generated a ci-dimensional quasi-Gray code using an additional k bits of 
data structure and writing at most a constant number of bits would, in our redefinition 
of the term, simply generate a (d + /c)-dimensional quasi-Gray code. 

Fredman's results generate codes while writing in the worst case 0(2'^) bits, which 
do not qualify as quasi-Gray codes under our definition of the term, and which have 
space efficiency 0(1/2^°'). 

Rahman and Munro |RM08j begin to address space efficiency while generating 
quasi-Gray codes, without naming it explicitly, which we continue in this thesis. 
However, we expand upon their work by also examining the average number of bits 
read by generating algorithms. Rahman and Munro construct quasi-Gray codes in the 
manner of Fredman, where they generate a c?- dimensional Gray code, while keeping 
some additional data structure. However, the authors use much smaller data struc- 
tures, eventually constructing an algorithm that requires only three extra bits. This 
brings the space efficiency of their algorithm up to 1/8, which was the first algorithm 
to read less than d bits in the worst case with space efficiency that did not become 
arbitrarily close to for large d. This algorithm generates a (i-dimensional quasi-Gray 
code while reading log{d — 3) + 6 bits in the worst case and writing at most 7 bits. 

Rahman and Munro do not use the DAT model, and they sometimes include in 
their analysis the amount of work to decode rankings inside the quasi-Gray code's 
structure. Under the DAT model, the rank can be embedded into the tree, so we 
don't need to examine such costs. For this reason we compare our work to theirs only 
in the case where they also ignore these costs. Rahman and Munro give one such 
algorithm, which ignores the work involved in determining ranks. The bounds given 
for this algorithm also hold in the DAT model. The algorithm uses only a single bit 
of extra data structure, giving it a space efficiency of 1/2. This algorithm generates 
a d-dimensional quasi-Gray code while reading log{d — 1) -|- 4 bits in the worst case 
and writing at most 4 bits. 

Rahman and Munro also consider the problem of adding or subtracting two num- 
bers, when each is stored using a quasi-Gray code representation. They give a data 
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structure that uses d = n + O(log^n) bits. Incrementing or decrementing a number 
on its own, equivalent to generating the next or previous bit string in the quasi-Gray 
code, requires at most O(logn) bits to be read and at most 5 bits to be changed. This 
data structure also supports adding or subtracting two such numbers of dimension d 
and d', where d > d' , while reading at most 0{d + logrf') bits. 

Savage highlights works around Gray codes in her survey [Sav97j . which includes 
both Gray codes as sequences of bit strings, and more general combinatorial sequences 
with minimal change between successive states, referred to as combinatorial Gray 
codes. The majority of the work related to quasi-Gray codes for bit strings discusses 
mathematical aspects of Gray codes such as existence of various classes of Gray codes, 
rather than algorithms for efficient generation of these sequences. 

Frank Ruskey devotes a chapter of his book-in-progress jRusOl ] to algorithms 
for generating combinatorial Gray codes. The majority of this work is devoted to 
generating other forms of combinatorial Gray codes than bit strings as we consider 
in this thesis, but he does include a section on generating the BRGC. In this section, 
Ruskey describes an algorithm to generate the (i- dimensional BRGC while reading 
and writing 0(1) bits in the worst case, by making use of an additional d bits. This 
gives a quasi-Gray code with space efficiency of 1/2'^. 

Knuth, in volume 4 of The Art of Computer Programming |Knu05j , discusses the 
problem of generating Gray codes. He gives an overview of various applications for 
Gray codes, and surveys some known results, both in generating them, and in analysis 
of other properties. Knuth shows an algorithm for loopless generation of a Gray code 
of dimension d, |BER76] where each generating step can be executed without any loop 
in the algorithm, that has space efficiency 2"^^. He also discusses other properties of 
Gray codes, such as a balanced number of bit flips for each bit position, having each 
bit keep its value for at least a constant number of states, or monotonicity, where 
each bit string of rank x in the code has at most as many bits set to 1 as the bit 
string of rank x + 2. 

Frandsen et al. |SFMS97] describe a method of generating a sequence of bit 
strings of dimension d = n + \ogn while reading and writing at most Oilogn) bits 
for each generating step. Their counter algorithm has space efficiency 0(l/n), which 
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converges to in the limit n — )■ oo. An observation by Brodal |Bro09j improves the 
space efficiency of their counter algorithm to 1/2, matching the efficiency of the work 



by Rahman and Munro. We improve on these counters in Section |4.5| in terms of 
average bits read and space efficiency. 



2.3 Upper and lower bounds 

We use the Decision Assignment Tree (DAT) model to analyze algorithms for gener- 
ating bit strings in a quasi-Gray code. The model was ffist introduced for this context 
by Fredman |Fre78] . We will describe the DAT model and how we use it in detail in 
Chapter |3| while briefly describing it here. 

An algorithm to generate a quasi-Gray code takes a bit string of dimension d as 
input, and modifies it to become the next bit string in the quasi Gray-code. This 
operation does not necessarily require reading all d bits of the current string. The 
DAT model can be used for proving both upper and lower bounds on the required 
number of bits to be read in order to generate a quasi-Gray code. In this work, we 
construct and analyze our algorithms under this model to provide upper bounds. 

Meanwhile, a non-trivial lower bound for the worst case number of bits read while 
generating a Gray code remains unknown. A trivial DAT, such as iterating through 
the standard binary representations of to 2*^ — 1, in the worst case, will require 
reading and writing all d bits to generate the next bit string, but it may also read 
and write as few as one bit when the least-significant bit changes. On average, it reads 
and writes 2 — 2^~'^ bits. Meanwhile, it is possible to create a DAT that generates the 



Binary Reflected Gray Code, as described in Section 3.3 This DAT would always 
write exactly one bit, but requires reading all d bits to generate each successive bit 
string in the code. This is because the least-significant bit is flipped if and only if the 
parity is even, which can only be determined by reading all d bits. 

To generate a Gray code of dimension d with length L = 2'^, Fredman |Fre78j uses 
the DAT model to show that any algorithm will require reading fl[logd) bits for some 
bit string. Fredman conjectures that for a Gray code of dimension d with L = 2'^, 
any DAT will have to read all d bits to generate at least one bit string in the code. 
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must have height d. This remains an open 



"'^In |RM08j the authors claim to have proven this conjecture true for "smah" d by exhaustive 
search. 



Chapter 3 



Decision Assignment Trees 



3.1 The Decision Assignment Tree Model 

In the DAT model, an algorithm is described as a binary tree. We say that a DAT 
which reads and generates bit strings of length d has dimension d. Further, we refer 
to the bit string that the DAT reads and modifies as the state of the DAT. Generally 
the initial bit string for a quasi-Gray code of dimension d, and thus the initial state of 
its generating DAT, is the bit string made up of a sequence of d zeros. Each internal 
node of the tree is labeled with a single fixed position < i < (i — 1 within the input 



bit string, and represents reading that bit i. Figure ?? shows a DAT that generates 



the BRGC of dimension d = 3. The BRGC that is generated is also seen in Figure 



Let T be a DAT of dimension d. The algorithm starts at the root of T, and reads 
the bit with which that node is labeled. Then it moves to a left or right child of that 
node, depending on whether the bit read was a or a 1, respectively. This repeats 
recursively until a leaf node in the tree is reached. 

Each leaf node of T represents a subset of states where the bits read along the 
path to the leaf are in a fixed state. More formally, a state Si for a DAT is represented 
by a single leaf Lj. When the DAT is in state 5*^, traversing the DAT while reading 
the current state will lead to the leaf L,. It is possible for two different states Si and 
Sj to share the same leaf if they both cause the same bit positions to be read and 
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Figure 3.1: A DAT that generates the Binary Reflected Gray Code on three bits. 
The bits are labelled from as the right-most to 2 as the left-most bit. The label in 
each internal node specifies which bit is being read. Control moves to the left child 
or right child if the bit is equal to or 1, respectively. The rules in the leaf nodes set 
a bit i to a new value, or 1. 

those positions share all the same values. In this case it is required at least one of the 
bits that wore not read on the path to the leaf Ljn must be in a different state each 
time the DAT traversal reaches L^. 

The leaf nodes each contain rules that describe which bits to update to generate 
the next bit string in the code. The update rules are constrained in the following 
ways: 

1. Each rule must set a single fixed bit directly to or to 1. 

2. The rules together must change at least one bit. 

Analysis under this model can be done by examining the structure of the tree. 
The worst-case number of bits read will be equal to the height of the tree, and the 
worst-case number of bits written will be equal to the maximum number of rules 
in any leaf of the tree. The average number of bits read and written are not easily 
derived from the tree's structure. The average number of bits read will be equal to 
the sum of all paths in the tree, weighted by the fraction of times the path is used 
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when generating the entire quasi-Gray code. The average number of bits written will 
be a similar weighted average, for the number of rules in the leaves. 

3.2 Assembling DATs 

Decision Assignment Trees can be assembled by joining together other DATs. We 
present here observations based on this. 

Lemma 3.1. Let L and R each be a DAT for a binary code of dimension d with space 
efficiency 1 . The L and R trees may he joined together, under a new root node, to 
create a DAT of dimension d-\-l with space efficiency 1. 

Proof. We join the L tree and R tree together by adding a new root node, and making 
L and R its left and right subtrees respectively. The subtrees L and R each read and 
write d bits. We assign them to the same bits, to c? — 1, and the root node to the 
d-th bit. We assume w.l.o.g. that reading a at the root node means to move to the 
root of the L subtree, while 1 means to move to the root of the R subtree. Assume 
that the d-th bit is initially set to 0. If it is 1, then swap L and R in what follows. 

It is clear that L remains a valid Decision Assignment Tree of dimension d. And 
because it never changes the d-th bit, it will never cause the R subtree to be used. 
Thus our initial construction is a valid Decision Assignment Tree of dimension o? + 1 
that counts through only 2°* states. 

Let La be the first state of L and Lz be the last state. If L is cyclic, then any 
two states such that La immediately follows Lz in the code are valid. Similarly, let 
Ra and Rz be the first and last states of R. We modify the construction to join the 
code generated by L to the code generated by R, producing a new code of dimension 
d+1: 

1. Make the update rules of Lz change the counter to state Ra, and change the 
d-th bit to 1. 

2. Make the update rules of Rz change the counter to state La, and change the 
d-th bit to 0. 
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Note that the leaf for state Lz may be shared by another state L^, and is therefore 
invoked multiple times by the L subtree. If this is the case, simply split the leaf node, 
giving it two children that differentiate on a bit that is different in Lz and L^. Repeat 
this process until the leaf for Lz is not used for any other states in L. 

Because the L subtree is able to count through 2"^ states, it will take 2"^ — 1 steps 
to go from La to Lz, and likewise for the R subtree. Within each of these steps, the 
d-th bit is not changed and the subtree is able to operate correctly. After generating 
2'^ — 1 bit strings in L, the d-th bit is changed, and the state is changed to Ra- This 
makes 2'^ consecutive bit strings, generated by the L subtree. Now the same argument 
holds for the R subtree, which will generate 2'^ — I bit strings within its own subtree, 
and then move to La, completing a full cycle through all 2'^'^^ possible bit strings. 
Thus, we have a cychc binary code of dimension d + 1 and space efficiency 1. □ 



3.3 Generating the BRGC 

Lemma 3.2. The Binary Reflected Gray Gode of dimension d can he generated by 
a DAT, which requires reading d bits and writing at most 1 bits to generate each 
successive bit string. 

Proof. The proof is by induction. For a Binary Reflected Gray Code of dimension 
1, create a DAT with height 1. At the root node, the single bit is read. If the root 
reads a 0, move to its left child, if it reads a 1, move to its right child. The left child 
changes bit to 1 and the right child changes bit to be 0. This generates the cyclic 
BRGC of dimension 1. 

Let L be a DAT for the BRGC of dimension d, and let i? be a DAT which generates 



the same bit strings as L in reverse order. Then, by Lemma [3.1[ we can use L and 
R to construct a new DAT of dimension d + 1. We choose La to be the bit string 
000. ..0 and Lz to be the bit string 1000. ..0. Since the BRGC is cychc. La is the state 
which follows Lz- We choose Ra to be the state 1000. ..0 and Rz to be 000. ..0. 

Because Rz = La and Lz = Ra, no bits need to be changed to move between 
them. Thus, in the combined DAT, only the {d-\- l)-th bit needs to change in order to 
generate La from Rz or Ra from Lz, and the DAT is able to move between subtrees 
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with only one bit changed. Further, the first 2^^ states will correspond to a dimension 
d BRGC, with a in the {d + l)-th bit. The second 2*^ states will correspond to a 
dimension d BRGC in reverse order, with a 1 in the {d + l)-th bit. This is precisely 
the definition of the BRGC of dimension d + 1. Thus, we are able to construct a DAT 
that generates the BRGC of any dimension. □ 

An example of a DAT that generates the BRGC for dimension d = 3 can be seen 
in Figure |??[ 



Chapter 4 

Efficient generation of quasi-Gray 
codes 

In this chapter we address how to efficiently generate quasi-Gray codes of dimension 
d. We examine efficiency in terms of the number of bits read and written in the worst 

case to generate each successive bit string, the number of bits read on average to 
generate each successive bit string while generating the entire code, and the space 
efficiency. The codes we generate arc all cyclic. First we present DATs that read up 
to d bits in the worst case, but read fewer bits on average. Then we present our lazy 
counters that read at most 0{logd) bits in the worst case, while also reading fewer 
bits on average. 

4.1 Recursive Partition Gray Code (RPGC) 

We show a method for generating a cyclic Gray code of dimension d that requires 
reading an average of 61og(i bits to generate each successive bit string. First, assume 
that d is a power of two for simplicity. In this special case, we show that it reads 
on average no more than 4 log d bits to generate each bit string in the code. We use 
both an increment and decrement operation to generate the Gray code, where the 
increment operation generates the next bit string of dimension d in the code, and the 
decrement operation generates the previous bit string of dimension d in the code. 



18 
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Both increment and decrement operations are defined recursively, and they make 
use of each other. Pseudocode for these operations is provided in Algorithms [T] and [2j 
To generate the next bit string in the code, we partition the bit string of dimension n 
into two substrings, A and B, each of dimension n/2. We then recursively increment 
A unless A = B, that is, unless the bits in A are in the same state as the bits in B, 
at which point we recursively decrement B. Testing A = B is done by reading and 
comparing sequential pairs of bits in A and B until a pair is found that differ. In the 
analysis, we will see that this test reads only a constant number of bits on average. 

To generate the previous bit string in the code, we again partition the bit string of 
dimension n into two substrings, A and B, each of dimension n/2. We then recursively 
decrement A unless A = B + 1, that is, the bits of A are in the same state as the bits 
of B would be after an increment operation, at which time we recursively increment 
B instead. Testing A = B + 1 can be done by simulating an increment of B and 
testing A for equality against the result. In the analysis, we will see that this test 
also reads only a constant number of bits on average. 



Figure shows a conceptualization of the Recursive Partition Gray Code. Wheels 
A and B represent the states in part A and B of the code, respectively. When A 
is incremented, it moves to the next clockwise location, and when B is decremented 
it moves to the next counter-clockwise location. And inside each wheel A and B is 
another set of wheels. 

The increment and decrement operations must partition the bit strings identically; 
we assume w.l.o.g. that the A partition contains the first n/2 bits, and B contains 
the remaining n/2 bits. Pseudocode for the increment {RecurIncrementPow2) and 
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Figure 4.1: Conceptualization of the Recursive Partition Gray Code. A moves clock- 
wise while A B, at which point B moves counter-clockwise. 



decrement {RecurDecrementPow2) operations follows. 



Algorithm 1: RecurIncrementPow2 



Input: an array of n bits 
1 if n = 1 then 

if b[0] = 1 then b[0] ^ 0; 
else b[0] ^ 1; 



4 else 



Let A = b[0...n/2-l]; 
Let B = b[n/2...n-l]; 

it A = B then Recur DecrementPow2{B)] 
else RecurIncrementPow2{A); 



9 end 



Algorithm 2: RecuiDecrenieutP(n\ 



■9 



Input: an array of n bits 

1 if n = 1 then 

2 if b[0] = 1 then b[0] ^ 0; 

3 else b[0\ 1; 



4 else 



Let A = 6[0...n/2-l]; 
Let B = b\n/2...n-l]: 
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Lemma 4.1. The Recursive Partition Gray Code algorithm can be performed by a 
DAT to generate a Gray code of dimension d when d is a power of two. 

Proof. We will recursively define a DAT Ifi that performs the RecurIncrementPow2 
operation, and a DAT D^ that performs the RecurDecrementPow2 operation. 

Let Cd be a DAT that compares two bit strings of dimension d to determine if 
they differ in at least one bit. Each leaf of Cd represents a single bit i of the bit 
strings, such that i is the first bit seen which differs. Assume the DAT scans the bit 
strings for differences from bit to bit d — 1 in order. Then number the leaves such 
that leaf Cd,i is reached when bit i is the first bit seen that differs between the two bit 
strings. Leaf Cd^ is reached when there is no difference between the two bit strings. 

Let Md be a DAT that compares two bit strings of dimension d, to determine 
if they differ in at least two bits. As the root of M^, use a Cd DAT. At leaves Cd,i, 
< i < (i — 2, place another DAT Cd-i-i, that compares the last d — i — 1 bits between 
the two bit strings. Call the leaves of these trees Cd^ij when they are the j-th leaf of 
the subtree rooted at the i-th leaf of the top-level Cd DAT. Then some leaf Cd^ij, for 
< i < d — 2 and j < d — i — 1, will be reached when the two bit strings differ in at 
least two bits, while some leaf Cd,i,j, ioi i > d — 1 or j > d — i, will be reached when 
the two bit strings have less than two bits which differ. Call the first set of leaves the 
2-differ leaves, and the latter the 0-differ leaves. 

h and Di have a height of 1, and are each constructed identically. The root node 
reads the single bit and control transfers to a child. The leaf nodes flip the value of 
the single bit: if the bit was 1 it writes a 0, and if the bit was it writes a 1. 

To construct a DAT for Id, with d > 1, place a DAT Cd/2 as the root. At each 
leaf except Cd/2,d/2, put the root node of an Id/2 DAT. At the leaf Cd/2,d/2-, place the 
root node of a Dd/2 DAT. 

To construct a DAT for D^, with d > 1, place a DAT Md/2 as the root, which 
considers the entire bit string of dimension d. At each 0-differ leaf in the Md/2 DAT, 
place the root of a Id/2 DAT, which simulates an increment operation on the last d/2 
bits, with each leaf being the root of a Cd/2 DAT. At each leaf of the Cd/2 DATs, 
place the root of a Dd/2 DAT that operates also on the last d/2 bits, with exception 
that the highest rank leaf in each Cd/2 DAT is the root of a Id/2 DAT that operates 
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on the first d/2 bits instead. Finally, at each 2-differ leaf node, place a 0,^/2 DAT 
that operates on the last d/2 bits. 

At the end of this process, look at each path from the root to a leaf in the final 
DAT. Let i and j be nodes on any such path such that i and j both read the same bit 
of the input string. Assume without loss of generality that j is in the left subtree of 
i. Then remove j and its right subtree from the DAT, replacing j with its left child. 
Continue this process until there are no such i and j in the DAT. □ 

Lemma 4.2. For a dimension d > 1, the RecurIncrementPow2 algorithm generates 
a Gray code of dimension d with length 2'^. 

Proof. The proof is by induction on d. 

Let d — 1, then the RecurIncrementPow2 algorithm flips the bit twice, creating 
a Gray code of length 2 = 2^ = 2*^. The same is true for the RecurDecrementPow2 
algorithm. 

Then assume it is true for dimensions less than d. We will show that for dimension 
d, the RecurIncrementPow2 algorithm generates a Gray code of length 2^^. 
Let |X| be the dimension of a bit string X. 

If d is even, and A = B initially, then the algorithm starts by decrementing B. 
For each decrement of B, the algorithm will increment A 2''^^ — 1 times in order to 
make them equal again. This comes from the fact that \A\ = d/2 < d and thus the 
bits move through 2^^/^ unique states. 

B is decremented 2'^^'^ times before it reaches its initial state again, since |-B| = 
d/2 < d. Since only one of A and B is changed at each step, the total number of bit 
strings generated is equal to the number of times B is decremented plus the number 
of times A is incremented. This is 2l^l + 2l^l • (2l^l - 1) = 2'^/^ + 2'^/^{2'^/^ - 1) = 
2d/2 _^ - 2^*/^ = 2^*. The same holds for RecurDecrementPow2 by a symmetric 
argument. 

At each step if the algorithm, bits are read but not written, except in the base 
case, where d — 1. Each generating step recurses on one sub problem, thus only one 
bit is written, and the resulting code is a Gray code. 
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Therefore, the RecurIncrementPow2 algorithm generates a Gray code of dimen- 
sion d with length L = 2'^. □ 

Theorem 4.3. Let d > 2 be a power of two. There exists a DAT that generates a 
Gray code of dimension d and length L = 2'^, where generating the next bit string 
requires reading on average no more than Alogd bits of the current string. In the 
worst case, d bits are read, and only 1 bit is written. 



Proof. By Lemma |4.1[ we can construct a DAT that performs the Recurlncrement- 
Pow2 and RecurDecrementPow2 operations and generates a Recursive Partition Gray 
code of dimension d. 

Since the RPGC has length L = 2'^, the algorithm will be executed once for each 
possible bit string of dimension d. Based on this observation, we bound the average 
number of bits read by studying the expected number of bits read given a random bit 
string of dimension d. The proof is by induction on d. For the base case d = 2, in the 
worst case we read at most 2 bits, so the average number of bits read in a random 
bit string is at most 2 < 4 log d. Then we assume our claim is true for a random bit 
string of dimension d/2. 

We define |X| to denote the dimension of a bit string X. Let C{A,B) be the 
number of bits read to determine whether or not A = B, where A and B are bit 
strings and \A\ = \B\. Let I{X) be the number of bits read to increment the bit 
string X. Let D{X) be the number of bits read to decrement the bit string X. Note 
that since we are working in the DAT model, we read any bit at most one time, and 
D{X) < \X\. 

To finish the proof, we need to show that E[/(X)] < Alogd, when X is a random 
bit string of dimension d. 

We can determine the expected value of C{A,B) as follows. C{A,B) must read 
two bits at a time, one from each of A and B, and compare them, only until it finds 
a pair that differs. Given two random bit strings, the probability that bit i is the 
first bit that differs between the two strings is 1/2*^^. If the two strings differ in bit 
i, then the function will read exactly i + 1 bits in each string. If \A\ = \B\ = n/2, 
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then the expected value of C{A, B) is 

i=Q i=l \ / ^ / 

Let X = AB, and \A\ = \B\ = d/2. Then |X| = d. 

For a predicate P. wc define Ip to be the indicator random variable whose value 
is 1 when P is true, and otherwise. 

Note that I{A) is independent of 1a=b and Iaj^b- This is because the relation 
between A and B has no effect on the distribution of A (which remains uniformly 
distributed among all bit strings of dimension d/2). The same is true of /(-B), D{A) 
and D{B). 

The Recur IncrementPow2 operation only performs one increment or decrement 
action, depending on the condition A — B, thus the expected number of bits read by 
I(X) is 

E[I{X)] = E[C{A, B)] + E[1a=bD{B)] + E[1^^b/(A)] 

< 4 + 4 log (i - 4 log 2 
= 4 log d , 

as required. 

Last, in the worst case when A = B, the comparison test will require reading all 
bits in A and B, and thus all d bits in the code. □ 

Now consider the case when d is not a power of two. When the increment operation 
is given a bit string with an even number of bits, the algorithm is the same as in the 
power of two case. However, when it is given a bit string with an odd number of 
bits, it uses the first bit as a direction bit. While the direction bit is 0, the increment 
operation will recursively increment the remaining bits, until they reach their state 
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of highest rank. At that point, the increment operation flips the direction bit to 1. 
Prom then on, while the direction bit is 1, the increment operation will recursively 
decrement the remaining bits, until they reach their state of minimum rank. Then 
the increment operation would flip the direction bit back to 0, reaching its initial 
state. This generates all possible states for the bit string. 

The decrement operation functions similarly to the increment operation when 
given a bit string with an odd number of bits, except that it decrements the remaining 
bits when the direction bit is and increments them when the direction bit is 1. 
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Pseudocode for the increment (Recurlncrement) and decrement {RecurDecre- 
ment) operations for the more general scenario follows. 
Algorithm 3: Recurlncrement 
Input: an array of n bits 
1 if n = 1 then 

if 6[0] = 1 then 6[0] ^ 0; 
else 6[0] ^ 1; 



4 else 



5 
6 
7 

8 
9 
10 

11 
12 
13 
14 
15 
16 
17 
18 
19 



if n mod 2 — 1 then 
Let W ^h[l...n-l]] 
if 6[0] = then 

// Is W in its maximal ranked state 1000. ..0? 
if rank{W) = 2""^ - 1 then 6[0] ^ 1; 
else Recur IncrementiW)] 
else 

// Is ly in its minimal ranked state 000. ..0? 
if rank{W) = then b[0] ^ 0; 
else Recur DecrementiW); 
end 
else 

Let A = b[0...n/2 - 1]; 
Let B = b[n/2...n - 1]; 
it A = B then Recur D ear ement{B); 
else Recur Increment{A); 
end 



20 end 



Algorithm 4: Compareinc 



Input: AW, an array of n bits; an array of n bits 

// Returns true if incrementing B would put the bits of B in the 

same state as the bits in A 
1 if n = 1 then 



return A[0] 7^ B[0]; 



3 else 



Let Ai = A[0...n/2 - 1]; 
Let A2 = A[n/2...n - 1]; 



/o 



11. 
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Lemma 4.4. For a dimension d > 1, the Recursive Partition Gray Code algorithm 
generates a Gray code of dimension d with length 2'^. 

Proof. The proof is by induction on d. 

Let d = 1, then the Recur Increment algorithm flips the bit twice, creating a Gray 
code of length 2 = 2^ = 2'^. The same is true for the RecurDecrement algorithm. 

Then assume it is true for dimensions less than d. We will show that for dimension 
d, the Recurlncrement algorithm generates a Gray code of length 2*^. 

If d is even, and A = B initially, then the algorithm starts by decrementing B. 
For each decrement of B, the algorithm will increment A 2'^^'^ — 1 times in order to 
make them equal again. This comes from the fact that \A\ = d/2 < d and thus the 
bits move through 2"^/^ unique states. 

B is decremented 2^^^^ times before it reaches its initial state again, since \B\ = 
d/2 < d. Since only one of A and B is changed at each step, the total number of bit 
strings generated is equal to the number of times B is decremented plus the number 
of times A is incremented. This is 2l^l + 2l^l • (2l^l - 1) = 2'^/^ + 2'^''^{2'^l'^ - 1) = 
2<i/2 _|_ 2<^ _ 2"^/^ = 2'^. The same holds for RecurDecrement by a symmetric argument. 

If d is odd, then the algorithm will increment the last d — 1 bits until they reach 
their maximally ranked state. Since d — l<d, this will generate 2'^~^ — 1 bit strings. 
Next, the algorithm will flip the flrst bit, one bit string, and decrement the last d — 1 
bits until they reach their initial state. This generates another 2'^~^ — 1 bit strings. 
Finally, it flips the flrst bit back to its initial state as well. In total, this generates 
2<^-i _ 1 + 1 + 2'^'^ - 1 + 1 = 2*^ bit strings. The RecurDecrement algorithm performs 
the same operations in a different order, producing the same number of bit strings. 

At each step if the algorithm, bits are read but not written, except in the base 
case, where d—1. Each generating step recurses on one sub problem, thus only one 
bit is written, and the resulting code is a Gray code. 

Therefore, the Recursive Partition Gray Code algorithm generates a Gray code of 
dimension d with length L = 2'^. □ 

Theorem 4.5. Let d>2. There exists a DAT that generates a Gray code of dimen- 
sion d and length L — 2^ , where generating the next bit string requires reading on 
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average no more than 6 log d bits of the current string. Only 1 bit is written in the 
worst case, and d bits are read. 



Proof. From Lemma |4.4[ we know the length of a RPGC of dimension d is exactly 



2 . Then, as in the proof of Theorem 4.3, the algorithm will be executed once for 



each possible bit string of dimension d. As such, we bound the average number of 
bits read by studying the expected number of bits read given a random bit string 
of dimension d. The proof is by induction on d. We will show that the expected 
number of bits read to increment is at most 5.623 log rf. In some cases, incrementing 
requires decrementing a substring of the code, and we will show that in these cases, 
the expected number of bits read by a decrementing step is at most 7.746 log bits. 

For the base case d = 2, in the worst case we read at most 2 bits to perform 
Recurlncrement, so the average bits read is at most 2 < 5.623 log (i. Similarly to 
perform RecurD cerement, we read at most 2 bits, and thus read on average at most 
2 < 7.746 log d. 

Then we assume that these both hold for a random bit string of dimension less 
than d. Let I{X) and D{X) be the number of bits read to increment or decrement the 
bit string X, respectively. To finish the proof, we will show that E[/(X)] < 5.623 log d, 
when X is a random bit string of dimension d. 

We define |X| to denote the dimension of the bit string X, and X~* refers to 
a substring of X with dimension |X| — t. Let C{A,B) be the number of bits read 
to determine whether or not A = B, where A and B are bit strings and |y4| = 
\B\ = n/2. Let M{A,B) be the number of bits read by the Compareinc algorithm, 
which determines whether or not A = B + 1, where A and B are bit strings and 
|y4| = \B\ = n/2. Note that since we are working in the DAT model, we read any bit 
at most one time. And finally, let / = 3.5 and D = 7.746. 



The expected value of C{A, B) is given in the proof of Theorem 4.3|as E[C(A, B)] 



2 + n/2 

2(2 ^^-^^ — j , and we can determine the expected value of M{A, B) as follows. 

M{A, B) compares two bit strings against a third, all of equal dimension, n/4. There 
are four possible outcomes for the Compareinc function. We examine the cost and 
probability of each to determine the expected value of M{A, B). 
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When Bi = B2 ^ Ai, Com,pareInc returns false. In this case Bi is compared 
against B2 and Ai. Since Bi — B2, we read all n/2 bits in Bi and B2. Since Ai 

differs from the others, we expect to read ^ 2^ ^ ^ — ^^^^ 

•4 = 1 

same argument as made for C{A, B). Thus we read at most 2 + n/2 bits on average 
in this case, which happens with probability — = — = 

2n/4 _ I 



2n/2 

When Bi 7^ B2 and B2 7^ A2, Compareinc returns false again. In this case B2 is 

compared against Bi and against A2. If the two comparisons read the bits of B2 in 

the same order, the second comparison will read many of the same bits as the first. In 

the DAT model, this means we can count far fewer bits read on average for the second 

2 + n/4: 

comparison. The expected number of bits read in each of Bi and is 2 — . 

The number of bits read in B2 is the max of the number of bits read in Bi and A2. 
Consider an equivalent problem. Given two random bit strings of dimension k, if 
we examine bit positions in both strings together, count the number of positions we 
expect to look at to find at least one 1 in each string. The probability that position 
i, 1 < i < k, is the first position where we have seen a 1 at some position j < i in 

22(fc— i)/'2i 2^2{k—i) Q^i 

each string is 2 — — — = 2—--. — . The first term counts the 

° 2^^ 2^* 2^* 

number of strings with the first i — 1 bits set to and the i-th bit set to 1 for one 
string, and at least one 1 somewhere in the first i bits of the other. It is multiplied 
by two, since this can occur with a 1 in position i for either of the two strings. The 
second term keeps us from double counting the case where the first 1 in both strings 
occurs at position i. The expected number of bit positions read is then 

i=l i=l ^ ^ 

which is the expected number of bits read in S2. The above summation converges to 
8/3 in the limit i — > 00, and so we can upper bound it by this value. Thus, in this 
case, the average number of bits we read is at most 20/3. This scenario occurs with 

/'2n/4'\2/2n/4 _ -|^\2 2"/^ _ 21+"/^ _|_ \ 2^+'*/'* — 1 

probability = ——rz = 1 ——rz . 

In the other two cases, Compareinc recurses on a problem of size n/2. Since there 
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are n bits in the input to the M{A, B), we can bound the average number of bits read 
in this case by n, which happens with probabihty 1 = 

1 

2"/4 ■ 

By combining these four cases, the expected value of M{A, B) is 

/2n/4_i\ / 2^+"/''-l\ / 1 \ 

m{A, B)] = [-^^) (2 + n/2) -^^) (20/3) + j n < 7.507 , 

for n > 2. This bound comes from maximizing the function, where according to 
Maple, the function has a maximum value fa 7.506883004, for n > 2. 

For a predicate P, we define Ip to be the indicator random variable whose value 
is 1 when P is true, and otherwise. 

Let A and B be two random bit strings of equal dimension. Note that I (A), 
I{B), D{A), and D[B) are all independent of 1a=b and 1a=b+i- This is because the 
relation between A and B has no effect on the distribution of A or 5 (which each 
remain uniformly distributed among all bit strings of their dimension). 

Throughout the following cases, we assume that the first bit of A or 5, if we 
rccursc on A or respectively, will always be read in the next recursive step, so we 
arc able to read the bit for free in the current step. This is true, since wc count 
the first bit explicitly in each inductive step, and the base case reads all of its bits, 
including the first. 

Let X be an input string of dimension d. Wc consider the cases when d is an even 
or odd number separately. First, assume that d is an even number. 

The Recurlncrement operation only performs one increment or decrement action, 
depending on the condition A ~ B. We bound the cost of D{B) here hj d/2 which 
is an upper bound for the DAT model. The comparison in C{A, B) will read the first 
bit of A and B, so at least one bit will be double counted, which we can subtract. 
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Thus the expected number of bits read by I{X), when \X\ is even, is 
E[I(X)] = E[C(A, B)]-l + E[1a=bD(B)] + E[1a^bI{A)] 

< 3 + /log(ci/2) 

because / = 3.5 > 3. 

We also examine the cost of the RecurDecrement operation, as we will need it for 
the case when d is odd. The RecurDecrement operation performs one decrement or 
increment action, depending on the condition A = B + 1. We make sure that we 
count both the first bit of A and B so that we can subtract one bit as being double 
counted. The first bit of B is always read by M{A, B), but the first bit of A is not 
read when Bi ^ i?2- We bound the cost of 1{B^ here by (i/2, giving us the expected 
number of bits read by -D(X), when |X| is even, as 

E\D{X)\ = E\M{A, B)\ + E[lsi^B.l] - 1 + E\\a=b^xI{B)\ + E\\a+b^xB){A)\ 

^ 7.507+ ^-l+^ + E[D(^)] 

< 6.507+ ^j;/ +E[D{A)] 

< 7.715 + L>log(d/2) 

< Dlogd , 

because D = 7.746 > 7.715. Elementary algebraic manipulation shows that ^^^^ 

V2 + 1 

is at most < 1.208. 

2 

Now consider the case when d is an odd number. Let X — xAB, where |x| = 1 
and \A\ = \B\ = {d — l)/2. We assume that the bit x can be read for free, which we 
will later show is the case. 

When X = 0, the Recurlncrement operation checks to see that AB ^ 100. ..0, and 
increments AB when it is true, otherwise it stops and reads no more bits. Let and 
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bj be the i-th and j-th bits of A and B, respectively. We assume that the comparison 
of AB to the bit string 100. ..0 is done by looking at the bits of A and of B in an 
alternating sequence, such that the first bit read is ao, then bo, followed by oi and bi, 
and so on. 

Consider the four cases for the possible values of ao and bo, the first bits read. 

Case ao — 0, bo — 0: Then we know AB ^ 100. ..0 and AB will be incremented. 
The first step of incrementing AB is to test A — B. This test will read the first bit of 
A and B, and so at least one bit is double-counted and can be subtracted here. Thus, 
in this case, the expected number of bits read will be at most 2 — 1-|-E[C(74~^, B~^')\^ 
E[l^-i^B-i/(A)] + E[l^-i=B-i^^(5)] = 1 + 4 - 2 ^^^ff + E[1a-i^b-i/(^)] + 

E[l^-i=s-ii^(i?)] = 5 - + E[l^-VB-i/(A)] + E[l^-i=B-i^(5)]. 

Case ao = 1, &o = 1- This case has the same analysis and expected value as the 
previous case. 

Case ao — \, bo — 0: Then we must continue checking if AB — 100. ..0. This 
will read an expected 2 — (2 + (o? — 2))2~('^~^^ bits. We already know the result of 
the test A — B from the bits ao and bo, so no more bits need to be read in order 
to proceed. And we can subtract one bit as we double count the first bit of A 
or B once again. Thus, in this case, the expected number of bits read will be at 
most 2 - 1 + 2 - ^ ""^^^J^ + E[l^-i^B-i/(^)] + E[l^-i=B-i^(S)] = 3 - ^ + 
E[l^-i^B-i/(A)] + E[l^-i=B-ii^(S)]. 

Case ao = 0, 6o = 1= Then we know AB ^ 100. ..0 and also that A^ B, and we 
do not read any more bits to continue to the next recursive step. We have also double 
counted the first bit of A or S and can subtract it. Thus, in this case, the expected 
number of bits read will be at most 2 — 1 + E[l^-i^s-i/(74)] + E[1a-i=b-i-D(-B)] = 
1 + E[l^-i^B-i/(A)] + E[l^-i=B-i^(fi)]. 

We bound the number of bits read for D{B) by ri/2, as this is an upper bound for 
the DAT model. Because each of these cases occurs once out of every four executions. 



CHAPTER 4. EFFICIENT GENERATION OF QUASI-GRAY CODES 



33 



the total expected value of /(X) when \X\ is odd and x = is 

+ E[1a^bI{A)] + E[1a=bD{B)] 
14 3 + d/2 d d/2 

< ^ + /log(ci/2) 
= I^ogd , 

because 7 = 3.5 = 14/4. 

Next we consider when x = 1. In this scenario, the Recur Increment operation 
checks to see that AB ^ 000. ..0, and decrements AB when it is true, otherwise it 
stops and reads no more bits. Let and bj be the i-th and j-th bits of A and B, 
respectively. In this case, wc do the comparison of AB to the bit string 000. ..0 in a 
different order. Wc first look at the bit bo, then bn/2, followed by bi and &n/2+i- 

We 

continue reading B in this alternating order until all bits have been read. Then we 
check the bits a„/2, cin/2+1, up to fln-i, and finally ao, ai, up to a„/2-i. 

Consider the four cases for the possible values of bo and bn/2, the first bits read. 
And note that M{A, B) always reads &o ^-nd 6„/2 so we do not count them outside of 
that function. 

Case 60 — I5 bn/2 — 0: Then we know that AB ^ 000. ..0. The decrement 
operation requires checking that A ^ B -\- 1, using the CompareInc function. This 
number of bits read by this function is M{A,B). We also count the first bit of A 
if Bi ^ B2 and it is not read by M{A, B), so that it is double-counted and we can 
subtract it. Thus, in this case, the expected number of bits read will be at most 
E[M{A, B)\ + E[1b,^b, 1] - 1 + E[1a^b+iD{A)\ +E[1a=b+iI{B)\ < 7.507 + ^ - 1 + 
E[1a^b+iD{A)]+E[1a=b+iI{B)] < 7.215 + E[1a^b+iD{A)] +E[1a=b+iI{B)]. This 
follows from the fact that is a decreasing function for d > 2, and its maximum 
value, at d = 2, is 2"^/^ < 0.708. 

Case bo = 0, bn/2 = 1: This case has the same analysis and expected value as the 
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previous case. 

Case bo — 1, hn/2 — 1: This case also has the same analysis and expected value 
as the previous two cases. 

Case 6o = 0, hn/2 — 0: Then we need to continue checking if AB ^ 000... 0, and 
if it is true, AB will be decremented. We note that this check reads bits in the same 
order as M{A, B) compares the bit strings A and B. While the check continues to find 
all bits, the function CompareInc would also find equality, so we know that it would 
read at least as many bits as we check to find a bit set to 1. This holds until 3n/4 bits 
have been read, which is the most that M{A, B) will read. When these bits are all 
0, we must continue reading the last n/4 bits to compare them against 000. ..0. This 

happens for 2"^^ of the possible 2" inputs, thus with probabihty 2"^"*^^. To compare 

2 -\- n/A 

the last n/4 bits against 000. ..0, we expect to read 2 — bits. We also make sure 

the first bit of A is double-counted, as when Bi ^ B2, it will not be read by M{A, B) 

or by the comparison to 000... 0. Thus, in this case, the expected number of bits read 

will be at most E[M(A, i?)]+E[l5,^B,l]-l + ^ {2 - +E[U^B+iDiA)] + 

1 2 2 + d/A 
E[U=B+im] < 7.507+^-1+^ ^+E[U^B+iD{A)]+E[U=B+iI{B)]. 

We bound the number of bits read for I{B) by d/2, as this is an upper bound for 

the DAT model. If \A\ is even, we can bound the expected cost of D{A) as shown 

above. When |A| is odd, then D{A) will have the same expected number of bits read 

as I (A), since they both increment half the time, and decrement half the time. Thus, 

in both cases, we are able to bound E[D(74)] by 7.746 \og{d/2) by induction. Because 

each of these cases occurs once out of every four executions, the total expected value 
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of I{X) when \X\ is odd and x = 1 is 

E[4=i(X)] < ^ (7.215) + ^ f 7.507 - 1 + -ij + ^ ^ + 



+ E[1a^b+iD{A)] + E[1a=b+i/(5)] 

1/1 2 2 + rf/4 2rf \ 

< 7.038 + + E[L'(^)] 

< 7.746 + Dlog(d/2) 
= Dlogd , 

1 2 2 + (i/4 

because D = 7.746. This follows from maximizing the function ^ 1- 

4 • d/2 

^^1^ , which according to Maple, has a maximum value ~ 2.831931894 for d>2. 

Last, we must consider the bit x. While generating the entire Gray code, the bit 
X will be for half of the time, and 1 the other half. Thus /^=o(^) will execute half 
of the time when |X| is odd, and lx=\{X^ will execute the other half. Therefore, the 
expected value of when \X\ is odd is 

E[7(X)] < E[l,=o4=oW] +E[l,=i4=i(X)] 

< ^(/log(i) + ^ (Dlogd) 

= ^(3.51ogd) + ^(7.746 log d) 

< 5.623 log d , 

because I — 3.5 and D — 7.746. Prom our initial choice of I and £), all of the 
above holds, and we have shown that in all cases, for a bit string X of dimension d, 
E[/(X)] < 5.623 log d. 

This satisfies the inductive proof, however, we did not count the cost of reading 
the bit X when the input string is odd. Note that when we recurse on a substring A 
or 5 of X, we have compared A = B (to increment) or A = + 1 (to decrement), 
each of which requires reading at least one bit of A and B. Since this bit will be x 
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when A or i? is odd, we will have always already read the bit x during the previous 
recursive step. Therefore, the bit x can be read for free whenever we are inside a 
recursive call. Only at the first level of recursion must we count reading the bit x. 
For any d, the algorithm's recursion eventually reaches the base case oi d — 2. In this 
case, the worst case number of bits read will be 2, but we count it as 6 log d — 6. This 
leaves 6 — 2 = 4 bits counted in our average that we did not actually read. These bits 
more than account for the one bit x at the top level of the recursion. 

Therefore, to generate the next bit string in the Recursive Partition Gray Code, 
given an input string X, the Recurlncrement algorithm will read on average E[7(X)] < 
6 log d bits. □ 



4.2 Composite code construction 

In this section, we show a method to take a code with space efficiency 1 that reads on 
average r bits, and construct a new, larger, code that reads on average O(loglogr) 
bits, while maintaining the same space efficiency, but increasing the worst-case num- 
ber of bits written by one. 

Lemma 4.6. Let d > 1, r > 3, p > 1, w > 1 be integers. Assume we have a DAT 
for a code of dimension d, that generates L — 2'^ hit strings, such that the following 
holds: Given a hit string of length d, generating the next hit string in the code requires 
reading no more than r hits on average, reads p hits in the worst case, and writes at 
most w hits in the worst case. 

Then there is a space-optimal DAT for a code of dimension d-\- [log r] , where gener- 
ating each hit string from the previous one requires reading no more than 6 log ("log r] -|- 
3 hits on average, reading p+ [logr] hits in the worst case, and writing at most w + 1 
hits. That is, the average number of hits read decreases from r to O(loglogr), while 
the worst-case number of bits written increases by one. 

Proof. We are given a DAT A that generates a code of dimension d. The DAT, 
A, requires reading on average no more than r bits, and writing at most w bits to 
generate each bit string. We construct a DAT B for the Recursive Partition Gray 
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Code of dimension d' , such as described in Section 4.1 The DAT, B, requires reading 
< 6 log d' bits on average to generate the next state, and requires writing only 1 bit 
in the worst case. 

We construct a new DAT from the two DATs of dimension d and d' . The combined 
DAT generates bit strings of dimension d + d'. The last d' bits of the combined code, 
when updated, will cycle through the code generated by B. The first d bits, when 
updated, will cycle through the code generated by A. 

The DAT initially moves the last d' bits through 2°^ states according to the rules 
of B. When it leaves this final state, to generate the initial bit string of B again, the 
DAT also moves the first d bits to their next state according to the rules of A. 



Figure 4^ shows a conceptualization of the Composite code. Wheels A and B 
represent the states in part A and B of the code, respectively. B moves clockwise 
around its wheel with each increment operation. When B moves from its highest 
state back to 0, then A also moves one step in the clockwise direction. 

During each generating step, the last d' bits are considered and moved to their 
next state in the code generated by the rules of B, and checked to see if they have 
reached their initial position. The first d bits are only incremented when the last d' 
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bits cycle back to their initial state - once for every 2'^ bit strings generated by the 
combined DAT. 

Incrementing the last d' bits requires < 6 log d' bits to be read on average and 1 

bit to be written. Checking if the d'-code has reached its initial state is equivalent 

to comparing it against a fixed bit string (that is the code's initial state). This is 

done an equal number of times for all possible bit strings of dimension d', and so the 

average number of bits read is equal to the expected position of the left-most 1 in a 

2<i'+^ - 2-d' 

random bit string of dimension d which is equal to < 2. 

If we let d' = [logr], then the RPGC B has dimension at least 2, from our 
restriction on r. The average read cost of the combined code to generate each bit 
string in the sequence becomes no more than the average cost of the d'-code, plus the 
cost to check if the code reached its initial state, plus the cost of d-code divided by 
the length of the d'-code. 

Therefore, the average number of bits read by the combined code is at most 

61ogd' + 2 + ^ =61ogriogr1 + 2+^^ 

t> to I to I I I 2 [log r] 

< 6 log [log r] + 3 . 

In the case that both A and B are incremented, the bits of B are equal to its 
initial state, so checking this requires reading all d' bits. This happens for all possible 
values in A, which requires reading p bits in the worst case. Therefore, at some time, 
the new code will read p + d' = p + [log r] bits. 

The number of writes in the worst case, is the number of bits that must change 
to update both the code contained in the first d bits, and the code contained in the 
last d' bits. Thus the number of bits written per increment operation is at most 
(bits written by d-code) + (bits written by d'-code) < w + 1. □ 

4.3 RPGC-Composite Code 



We are able to use the RPGC from Theorem 4^ with our Composite code from Lemma 
4.6 to construct a new space-optimal DAT that generates a code. By applying Lemma 
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4.6 to the RPGC, and then repeatedly applying it c — 1 more times to the resulting 
DAT, we create a DAT that generates a code while reading on average no more than 
6 log'-^'^"^^ d+ 11 bits, and never writing more than c bits to generate each bit string, 
for any constant c > 1. 



We use the following lemmas to prove Theorem 4.10 



Lemma 4.7. Given r > 16, then 61og(logr + 1) < 6 log log r + 2. 

Proof. Let x = logr. Then we want to prove that given x > A, 61og(a; + 1) < 
61oga: + 2. It is equivalent to show that log(l + 1/a;) <log(l + l/4) < 1/3. Therefore 
61og(logr + 1) < 61oglogr + 2. □ 

Lemma 4.8. Let c> 1 and d be integers such that log'-'^^"'^^ d> 2. Then log^^\d/2) > 
log^'^^d- 1. 

Proof. The proof is by induction. Let c = 1. Then log((i/2) = logrf — 1. 

Assume the claim is true for c, we will show that it is then true for c + 1. From 
our choice of c and d, we know that 

log(") d>2 
21og(")rf-2 > log'-'U 
log(2(log(") d-l))> log(log(") d) 
1 + log(log(') d-l)> log^'^^^ d 
log(log(") d-l)> log("+^) d - 1 

log(log^'^^((i/2)) > log*-'^"^^-' d — 1 (from our inductive hypothesis) 
log('=+^)(rf/2) >\og^'+^U-l , 

as required. □ 

Lemma 4.9. Let d and c > 1 be integers such that log^'^'^^^^ d > 11, and let m = 
d - [log(61og(^'=-^) d + 11)1 . Then log^^""^) m > 11. 

Proof. Given that log*-^^"^^-* d > 11, then we can say that log log (i + 4 < d/2 because 
d > 16. By our choice of d, we know that logrf > 11 and thus d > 2^^ > 16. Using 
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this, we can show that m > d/2 a.s follows: 

m = d- \\og{Q log(2^-^) d + 11)] 

> rf-log(6W2'="^^rf+ 11) - 1 
>d- log(6 log^^'"^) d + \og^^'-^^ d) - 1 
= rf-log(71og(^"-^) d)-l 

= d-log(2^)d-log7-l 
>d- (log(2'=) d + A) 
>d - (log log d + A) 

> d/2 . 

By our choice of c and d we know that log^^'^"^^ c? = log(log*-^'^~^'' c?) > 11 and 



log(^'-2) d >2^^ > 2. Then, since m > d/2, and with Lemma |4.8[ it follows that 
log(2^~^) m > log(2^-^)(t;/2) > log(2^-^) d-l> 2^" - 1 > 11. □ 

Theorem 4.10. Given integers d and c > 1, such that log*-^^"^-* d > 11. There exists 
a DAT of dimension d that generates a code of length L = 2'^ , where generating the 
next hit string requires reading on average no more than 6 log^^'^""'^'' c? + 11 hits and 
writing in the worst case at most c hits. 

Proof. The proof is by induction on c. Let c = 1. In this case, the Recursive 



Partition Gray Code in Section 4J^ satisfies the requirements. From our choice of 
d, the RPGC will have dimension at least two. It requires reading no more than 
61ogd < eiogWrf + 11 bits on average, and writes 1 bit in each generating step. 

Then let c > 1, and assume that the theorem is true for c. We will show it is true 
for c + 1. 

Let d be such that log^^"^"*"^^ d > 11, and define m = d - [log(6 log^^''"^^ d + 11)]. 



Then from Lemma 4^ we know that log*-^'^"^'' m > 11. Thus we assume our initial 
claim holds true for c, and there exists a DAT M of dimension m that requires reading 
on average no more than 6 log*-^'^"^'' m + 11 < 6 log*-^'^^^-' c? + 11 bits, and writing at 
most c bits for each transition. 



We use the construction from Lemma 4.6 to perform the inductive step and con- 



struct a new DAT. As input to Lemma 4.6, we use our DAT M, which gives us 
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r = 6 log*-^^"^-* (i + 11 > 3 and w = c. This produces a new composite DAT of di- 
mension m + [logr] = m + [log(61og''^'^~^^ c/ + 11)] = d that reads no more than 
6 log [log r] + 3 bits. Note that from our choice of c and d, log? < log 11 < log^'^'^^d. 
Then our new DAT will read on average no more than 

6 log [logr] + 3 < 61og(logr + 1) + 3 



< 6 log logr + 5 (from Lemma 4.7) 
= 6 loglog(6 log^^"^-^) + 11) + 5 

< 6 log log(6 log(^"-^) d + log(2"-^) d) + 5 
= 61oglog(7W2"-^) d) + 5 

= 6 log(log 7 + log(2') d) + 5 

< 6 log(log(2'=) d + log(2") d) + 5 
= 6 log(2 log(^'^ d) + 5 

= 61og2 + 61og(^'=+^^ d) + 5 
= 61og(2^+^) d) + ll 



bits, as required. 



From Lemma 4^, in the worst case, our DAT writes at most w + 1 = c + 1 bits 
to generate the next bit string. These satisfy our initial claim, finishing the proof. 

Thus, there exists a DAT for any c > 1, when log'-^'^"^^ > 11, that generates all 
2'^ bit strings of a code of dimension d, and requires reading on average no more than 
61og''^'^~^^ d+11 bits, and writing at most c bits to generate each successive bit string 
from the previous string. □ 

4.4 Reading a constant average number of bits 



We have shown in Section |4.3| that it is possible to construct a code that requires 
reading, on average, at most Glog*-^^"^-* d + 11 bits, and never requires writing more 
than c bits, for any constant c > 1. This allows reading a small number of bits while 
writing at most a constant amount. 
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From Theorem 4.10 by taking c to be a function of log* d, it immediately follows 
that we can create a DAT that generates all 2'^ bit strings of dimension d, for which 
each generating step requires reading a constant number of bits on average. This 
is a trade off, as the DAT requires writing at most 0(log* d) bits in the worst case, 
meaning the code generated by this DAT is not considered a quasi-Gray code. 

Corollary 4.11. There exists a space-optimal DAT of dimension d which has the 
following properties for each generating step: 

1. For any d > 2^^, the DAT reads no more than 6 ■ 2"^^^ + 11 bits on average, and 
writes no more than [(log* d — 1)/2J bits in the worst case. 

2. For any c? > 3 + 2^^, the DAT reads no more than 100 bits on average, and 
writes no more than [(log* + l)/2j bits in the worst case. 

3. For any d > 10 + 2^'^ , the DAT reads no more than 20 bits on average, and 
writes no more than [(log* (i + 3)/2j bits in the worst case. 

4- For any d > 15 + 2F , the DAT reads no more than 17 bits on average, and 
writes no more than [(log* d + 5)/2j bits in the worst case. 

Proof Let d > 2^^ and c = [(log* - 3)/2j. Then c > 1 and log^^'^"^) > 11 and 
it follows from Theorem 4.10| that there exists a space-optimal DAT of dimension d, 



which has the following properties for each generating step: In the average case, it 
requires reading no more than 6 log(2L{iog*rf-3)/2j-i) ^^^^ ^ g j^g(2{(iog' d-3)/2-i/2)-i) 

11 < 6 log(^°s* 11 < 6 ■ 2^'' + 11 bits. And in the worst case, it requires writing 

at most [(log*rf- 1)/2J bits. 

Let d > 3 + 2^\ andm = d-{3 + 2^^) > 2^^. From our previous statement, we can 
construct a DAT of dimension m, which requires reading at most 6 ■ 2^^*^ + 11 < 2^~^^^^' 



bits on average, and writing no more than 
worst case. Use this DAT with Lemma 



4.6 



_(log*m-l)/2j < [(log*rf-l)/2j bits in the 
, setting r = 2^+^'" and w = [(log* d-l)/2\ . 
Then there exists a DAT of dimension m + [log r] = m + 3 + 2^^ = d, that requires 
reading on average no more than 6 log [log r] + 3 < 100 bits to generate the next bit 
string in the code, and writes no more than [(log* d + l)/2j bits in the worst case. 
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Let d > 10 + 2^^, and m = d — 7>3 + 2^"^ . Use the DAT from our previous 
statement with Lemma 4.6, setting r = 100 and w = [(log* + 1)/2J. Then there 
exists a DAT of dimension m + [log r] = m + 7 = d, which requires reading on average 
no more than 6 log [log r] + 3 < 20 bits to generate the next bit string in the code, 
and writes no more than [(log* d + 3)/2j bits in the worst case. 

Let d > 15 + 2^'', and m = d — 5> 10 + 2^^. Use the DAT from our previous 



statement with Lemma 4.6 , setting r = 20 and w = [(log* d+3)/2\ . Then there exists 
a DAT of dimension m + [log r] = m + 5 = d, which requires reading on average no 
more than 6 log [log r] + 3 < 17 bits to generate the next bit string in the code, and 
writes no more than [(log* d + 5)/2j bits in the worst case. □ 



4.5 Lazy counters 

A lazy counter is a structure for generating a sequence of bit strings. In the first n 
bits, it counts through the standard binary representations of to 2" — 1. However, 
this can require updating up to n bits, so an additional data structure is added to 
slow down these updates, making it so that each successive state requires fewer bit 
changes to be reached. We present a few known lazy counters, and then improve 
upon them, using our results to generate quasi-Gray codes. 

4.5.1 Lazylncrement 

Frandsen et al. |SFMS97] describe a lazy counter of dimension d that reads and writes 
at most 0{\ogd) bits for an increment operation. The algorithm uses d = n + logn 
bits, where the first n are referred to as 6, and the last logn are referred to as i. A 
state in this counter is the concatenation of h and i, thus each state is a bit string of 
dimension d. In the initial state, all bits in h and i are set to 0. The counter then 
moves through 2""*"^ — 2 states before cycling back to the initial state, generating a 
cychc code. 

The bits of h move through the standard binary numbers. However, moving from 
one such number to the next may require writing as many as n bits. The value in i 
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is a pointer into b. For a standard binary encoding, the algorithm to move from one 
number to the next is as follows: starting at the right-most (least significant) bit, for 
each 1 bit, flip it to a and move left. When a bit is found, flip it to a 1 and stop. 
Thus the number of bit flips required to reach the next standard binary number is 
equal to one plus the position of the right-most 0. This counter simply uses i as a 
pointer into b such that it can flip a single 1 to a each increment step until i points 
to a 0, at which point it flips the to a 1, resets i to 0, and b has then reached the 
next standard binary number. The algorithm is stated as follows: 

Algorithm 6: Lazylncrement pFMS97j 
Input: 6[]: an array of n bits; i: an integer of logn bits 

1 if b[i] = 1 then 



^ 0; 



4 else 



^ 1; 
i ^ 0; 



7 end 



Lemma 4.12. [SFMSQl ^ Let n > 2 be a power of two. There exists a DAT of 
dimension d = n + logn, using the Lazylncrement algorithm, that generates 2""*"^ —2 
of a possible n2" bit strings, where in the limit n oo the space efficiency drops to 0. 
The DAT reads and writes in the worst case logn+ 1 bits to generate each successive 
bit string, and on average reads and writes 3 bits. 

Proof. The maximum number of bit strings generated by this counter is 2'^ = n2"', 
however it actually generates significantly less. 

The bits of b move through each standard binary number, but with additional 
states in between, such that each state in differs by a single bit in b. Thus, the 
number of states between two of these numbers is equal to the number of the bits 
that need to change, and this is equal to the distance to the right-most bit in b. 

The right-most bit in 6 is for half of the standard binary numbers. For the other 
half, there is a to the left of it half of the time. If the right-most bit is position 
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1, then bit j contains the right-most exactly 2"/2-' times, over all standard binary 

numbers which fit in n bits. Meanwhile, the bits in i make it possible to have these 

transitional states, but don't provide any additional states beyond them. Thus we 

can count the number of total states in the counter. Bit j is the right-most in 6 for 

2"/2-' numbers, and each time it is, it requires j bit flips to reach the next number. 

Additionally, when there are no bits in b at all, n bit flips are required to reach 

the next number, that returns b back to all zeros. Let G{d) be the number of bit 

strings generated by Lazy Increment, for dimension d = n + logn. Then the number 

" ^ 2" " ^ j 

of generated bit strings is G{d) = ^j — + n = 2"^ — + ra = 2"+^ - 2. 

i=i i=i 
The space efficiency of this counter, or the ratio of bit strings generated to the 

. G(d) 2^^+! - 2 2" - 1 ^. 2^^ - 1 ^ , 
number of possible strmgs is — 3— = = -. Since lim = 0, the 

counter is non-space-optimal, and the space efficiency of the counter grows worse as 

its dimension grows larger. 

To generate each successive bit string, the worst-case number of bits read or 
written by this counter is the cost to increment or reset i, which is an integer of 
dimension logra, plus the single bit which changes in b. Thus the counter never reads 
nor writes more than logn + 1 bits. On average, incrementing an integer (using the 
standard binary representation) requires reading and writing at most 2 bits, while 
only a single bit is ever read or written in 6 at a time. Thus the average cost to 
generate a bit string in this counter is at most 3. 

The Lazylncrement algorithm presented above can be constructed as DAT. First 
construct a DAT that reads the value of i. This tree reads all logn bits of i and has 
n leaves, one for each value of i. The leaf that represents i = x will then read bit x 
of b. If the bit was a 0, it is changed to a 1, and all of the bits in i are set to 0. If the 
bit was a 1, then the bit is changed to a and i is changed to represent x + 1. □ 

4.5.2 Spinlncrement 

An observation by Brodal |Bro09j (unpublished) leads to a dramatic improvement 
in space efficiency over the previous algorithm by adding a single bit to the counter. 
This extra bit allows for the logn bits in i to spin through all their possible values. 
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thus making better use of the bits and generating more bit strings with them. The 



variables b and i are unchanged from the counter in Lemma 4.12 , and is a single bit, 
making the counter have dimension d = n + logn + 1. The algorithm is as follows. 
Algorithm 7: Spinlncrement |Bro09] 
Input: an array of n bits; i: an integer of logn bits; k: a single bit 

1 if /c = then 

2 i i— i + 1 ; / / spin i 

3 if i = then 

4 k 1 ; // the value of i has rolled over 

5 end 



6 else 



7 
8 
9 
10 



L azy Increment (6 [], i) ', // really increment the counter 
if i = then 

A; ^ 0; 
end 



11 end 



Lemma 4.13. lBro09^ Let n > 2 be a power of two. There exists a DAT of dimension 
d = n + logn, using the Spinlncrement algorithm, that generates {n + 1)(2" — 1) of 
a possible 2n2" bit strings, where in the limit n — )■ oo the space efficiency converges 
to 1/2. The DAT reads and writes in the worst case logn + 2 bits to generate each 
successive bit string, and on average reads at most 4 bits. 

Proof. This counter spins through i every time that b becomes equal to its next value 

in the standard binary representation, for which there are 2" such cases. Spinning 

through all values for i adds exactly 2^°^" = n states. Thus the total number of states 

for this improved lazy counter is ra2" + 2"+^ - 2 - 2" = n2" + 2" - 2 = (n + 1)2" - 2. 

(n + 1)2" — 2 

The space efficiency of this improved counter is . In the limit n — )■ oo 

the space efficiency converges to 1/2. That is, when the counter has large dimension 
d, approximately half of the 2^^ possible bit strings are generated. 

The worst-case number of bits read or written by this counter is one more than the 



counter in Lemma 4.12 The only cases added are where i and k are changed, which 
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together do not exceed that bound. However, in the case where b and i are changed, 
k may also be read and changed, so the counter may read and write at most logri + 2 
bits to generate the next bit string. Checking the value of k requires reading a single 
bit, incrementing i requires on average to read and write at most 2 bits. Comparing 
i for equality to the fixed bit string 000. ..0 can be done for free after incrementing i 
in the DAT model. The Lazylncrement algorithm reads at most 3 bits on average, 
after k has been read, and is run an average of 2 times with n increments in between, 
contributing at most 4 to the average. From these observations, it follows that the 
average number of bits read and written by this counter, to generate the next bit 
string, does not exceed 4. 

The Spinlncrement algorithm can be constructed as DAT using the DAT from 



Lemma [4.12[ Add a new root node that reads k. When = 0, go to its left child. The 
right child is a subtree that reads all the bits of i. For all leaf nodes of this subtree, 
i is incremented. And when i was equal to n — 1, A; is also set to 0. When k = 1, 
go to its right child, which is the root of a subtree mostly identical to the DAT for 
Lazylncrement, with an extra rule in the leaf where i is set to 000. ..0, that sets k to 
as well. □ 



4.5.3 DoubleSpinlncrement 

By generalizing the dimension of fc, we are able to make the counter even more space 
efficient while keeping its O(logn) = 0{\ogd) worst-case bound for bits written and 
read. Let k he a bit array of dimension g > 1. Then for a counter of dimension 
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d = n + logn + g, the new algorithm is as follows. 
Algorithm 8: DoubleSpinlncrement 
Input: an array of n bits; i: an integer of logn bits; k: an integer of g bits 

1 if < 2^ - 1 then 

2 i ^ i + 1 ; / / happens in (2^ — l)/2^ of the cases 

3 if i = then 

4 k ^ k + 1; 

5 end 



6 else 



7 
8 
9 
10 



L azy Increment (6 [], i) ', // do a real increment 
if i = then 

A; ^ 0; 
end 



11 end 



Theorem 4.14. Let n > 2 be a power of two, and g > 1 be an integer. There exists a 
DAT of dimension d = n + logn + g with space efficiency 1 — 0(2^^). The DAT, using 
the DoubleSpinlncrement algorithm, reads and writes in the worst case g + logn + l 
bits to generate each successive bit string, and on average reads and writes 0(1) bits. 

Proof. This counter generates 2^ — 1 states for each time it spins through the possible 
values of i. Thus the number of bit strings generated is n2"(2^ — 1) + 2" — 2 = 
n2"'2^ — (n — 1)2" — 2. Given the dimension of the counter, the possible number of 
bit strings generated is n2"'2^. This gives a space efficiency of 1 — 0{2~^). When 



g = 1, we have exactly the same counter as given by Lemma 4.13 and when g is an 
increasing function of n, we produce a counter with space efficiency arbitrarily close 
to one. 

In the worst case, this counter reads and writes every bit in i and k, and a single 
bit in b, thus g + logn + 1 bits. On average, the counter now reads and writes 0(1) 



bits. This follows from a similar argument to that made for Lemma 4.13, where each 
line modified in the algorithm still reads on average 0(1) bits. 

The DoubleSpinlncrement can also be constructed as DAT by building on the 
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DAT from Lemma 4.13 Replace the root node with a subtree that reads all the bits 
of k and has 2^ leaf nodes. For leaf nodes that read a value in k less than 2^ — 1, the 
leaf node becomes the root of a subtree similar to the right child of the root node in 
the DAT for Spinlncrement. These subtrees are modified in that k is incremented 
instead of set to when i was equal to n — 1. The one leaf node where k = 2^ — 1 
becomes the root of a subtree identical to the left child of the root node in the DAT 
for Spinlncrement. □ 

Corollary 4.15. Let n > 2 be a power of two, and g = tlogn be an integer, for 
t > 0. There exists a DAT of dimension d = n + {t + 1) logn with space efficiency 
1 — 0{n~^). The DAT, using the DoubleSpinlncrement algorithm, reads and writes 
in the worst case {t + 1) log n + 1 bits to generate each successive bit string, and on 
average reads and writes 0(1) bits. 



Proof. This follows from Theorem 4.14 by substituting g = t log n. □ 



4.5.4 Winelncrement 

Rahman and Munro |RM08j present a counter that reads at most log n + 4 bits and 
writes at most 4 bits to perform an increment or decrement operation. The counter 
uses n + 1 bits to count through 2" states, and has space efficiency 2"/2'^"^^ = 1/2. 
Compared to DoubleSpinlncrement, their counter writes fewer bits per generating 
step, but is less space efficient. By modifying our lazy counter to use Gray codes 
internally, the worst-case number of bits read remains asymptotically equivalent to 
the counter by Rahman and Munro, and the average number of bits we read increases. 
We are able to write a smaller constant number of bits per increment and retain a 
space efficiency arbitrarily close to 1. 



We modify our counter in Theorem 4.14 to make i and k hold a cyclic Gray code 



instead of a standard binary number. The BRGC is a suitable Gray code for this 
purpose, so we will use it. Given a function next(j) that takes a bit string j of 
rank r in the BRGC and returns the bit string of rank r + 1, and a function rank(j) 
that returns the rank value of the bit string j in the BRGC. The following algorithm 
provides a lazy counter of dimension d = n + logn + g, where g > 1, that writes at 
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most 3 bits, and reads at most g + log n + 1 bits to generate the next state, and is 
space-optimal in the limit n — > oo. 

Algorithm 9: Winelncrementj^ 
Input: an array of n bits; i: a Gray code of logn bits; k: a Gray code of g 
bits 

1 if A; ^ 100. ..00 then 



i ^ next(i) ; // happens in (2^ — l)/2^ of the cases 
if i = then 

k ^ next (A;); 
end 



6 else 



7 
8 
9 
10 
11 
12 
13 
14 
15 
16 



// do a real increment 
if 6[rank(i)] = 1 then 
6[rank(i)] ^ 0; 
i ^ next(i); 
if z = then 

k ^ ] // wraps around to the initial state 
end 
else 

6[rank(?)] ^ 1; 
k ^ ; // resets to 
end 



17 end 



Theorem 4.16. Let n > 2 be a power of two, and g > 1 be an integer. There exists 
a DAT of dimension d = n + logn + g with space efficiency 1 — 0{2~^). The DAT, 
using the Winelncrement algorithm, reads in the worst case g + logn + 1 bits and 
writes in the worst case 3 bits to generate each successive bit string. 



^The name Winelncrement comes from the Caribbean dance known as Wincing, a dance that is 
centered on rotating your hips with the music. The dance is pervasive in Caribbean culture, and 
has been popularized elsewhere through songs and music videos such as Alison Hinds' "Roll It Gal" , 
Destra Garcia's "I Dare You" , and Fay- Ann Lyons' "Start Wincing" . 
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Proof. The number of states used by this counter would be the same as the 
DoubleSpinlncrement algorithm, except that we are unable to reset i to when 
a bit in b is flipped to 1 without changing up to logn bits in i. Instead, we leave i 
as it is and observe that it does not significantly reduce the space efficiency of the 
counter. 

Note that when a bit in b is flipped to a 1: 

• rank{i) points to the index of the right- most 1 in b, counting from the least- 
significant bit. 

• a bit in b flips to 1 if and only if the bit sequence b is entering the next state 
of the standard binary number encoding. That is, b counts from thru 2^ — 1, 
and back to 0, as a standard binary number, with extra states in between. The 
extra states all come from steps where a bit in b flips to 0. 

Based on these observations we can sum up all the values of i that occur when 

a bit in b flips to 1 as the positions of the right-most 1 in all possible bit strings of 

n 

dimension n, which is 2" — = 2""*"^ — n — 2. When i has a value greater than 0, 

the number of states lost compared to DoubleSpinlncrement is exactly the difference 

between i and 0. Thus, over all increment steps, this summation describes the total 

number of states lost compared to our third lazy counter. Therefore the total number 

of states used is n2"2» - {n - 1)2'^ - 2 - 2"+^ + n + 2 = n2"2» - {n + 1)2" + n. 

The number of bits, and thus, the number of possible states, is unchanged from the 

DoubleSpinlncrement algorithm. There are 2"+'°^"+^ possible states, and the space 

efficiency of this counter as n grows large converges to 

n2"25 - (n + 1)2" + n ^ (n + 1)2" - n , „^ 

lim ^ ^ = 1 - lim ^ ^ = 1 - 0(2"^) . 

n-*-oo n2"2f n^oo n2"2s 

The average number of bits read for each line of the algorithm is 0(1), with the 
exception of the lines which increment i and k. When they are executed, these lines 
read on average log n and g bits respectively, so the total average number of bits read 
is at most logn + g + 1. 

The counter writes at most one bit in each of b, i, and k, and thus writes in the 
worst case 3 bits per increment operation. □ 
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Corollary 4.17. Let n > 2 be a power of two, and t\ogn be an integer, for t > 0. 
There exists a DAT of dimension d = n + {t+l) log n with space efficiency 1 — {n~^) . 
The DAT, using the Winelncrement algorithm, reads in the worst case (t + 1) logra + l 
bits and writes in the worst case 3 bits to generate each successive bit string. 



Proof. This follows from Theorem 4.16 by substituting g = t log n. □ 



From Corollary 4.17 , we get a counter that is more space efficient than previously 
known counters, with a constant number of bits written for each increment. But while 
the counter reads at most {t + 1) \ogn + 1 bits in the worst case, its average number 



of bits read is also 0(log?T,). Using the quasi-Gray code counter from Theorem 4.10 
we are able to bring the average number of bits read down as well. The worst case 
number of bits read remains {t + 1) logn + 1, but on average, we only read at most 
121og^^'') n + 0(1) bits, for any c > 1. 

The algorithm does not need to change from its fourth iteration for these modifi- 



cations. We simply make i a quasi-Gray code from Theorem 4.10 of dimension log n 
and k a similar quasi- Gray code of dimension g. 

Theorem 4.18. Let n be a power of two such that log'-^'^^ n>ll and g be an integer 
such that \og^'^^~^'* g > 11. Then for any c > 1, there exists a DAT of dimension 
d = n + logn + g bits, using the Winelncrement algorithm, with space efficiency 
1 — 0{2~^), that reads in the worst case g + logn + 1 bits, writes in the worst case 
2c + 1 bits, and reads on average no more than 121og*-^'^^n + 0(1) bits. 

Proof. We determine the average number of bits read in the previous algorithm for 
each line of the Winelncrement function. Line [T] becomes true each time b reaches 
a standard binary number representation, and there are 2" such numbers. Each time 
the line becomes true, it stays so while i and k spin through their values together. 
During this time, the value of k changes every n steps, and we see all possible values of 
k except its maximally ranked state 1000. ..0. In the DoubleSpinlncrement algorithm, 
this would account for n2"(2^ — 1) states, but all states lost due to i happen during this 
stage, and so it accounts instead for n2"(2^-l)-2"+^+n+2 = r22"2^-(r2+2)2"+n+2 
states. 
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If we saw each value of k once, the number of bits read to test if the string is 

g 

equal to 1000. . .0 would be ^ i ■ 2^ /2' = 2^+^ -g-2. Instead, we see each value in k 

i=l 

for n consecutive steps, except for state 1000.. .0. Therefore, the number of bits read 
between states where k = 1000. ..0 is n{2^+^ -g-2- g2^ /2S) = n2^+^ - 2ng - 2n. 

During the remaining 2" — 2 states, k = 1000. ..0, and line [T] will read all g bits 
each time it is executed, reading a total of g2^ — 2g bits. Thus the the average number 
of bits read by line [T] is 

^2f+i - 2ng -2n + ^2" - 2g 



< 



n2"29 -{n + 1)2" + n 

n29+^ + ^2" 

^2"+s - {n + 1)2" + n 

1 n2"+s g n2'^+3 

+ 



2n-i n2^+g - (n + 1)2" + n n29 n2"+f - (n + 1)2" + n 

1 n2"+3 - (n + 1)2" + n + (n + 1)2" - n 
2"-i ra2"+9 - (n + 1)2" + n 

g ra2"+9 - (ra + 1)2" + n + {n + 1)2" - n 



+ 



< 



n29 n2"+f -{n + 1)2" + n 

1 / (n+l)2" \ g ( (n + l)2" 



2"-i V n2"+9 - (n + 1)2" + n / n29 \ n2"+s - (n + 1)2" + n 



2(n + l) 2 
< 2 + — \ ^ + 



n2"+9 - {n + 1)2" + n n2"-+9 - {n + 1)2" + n 
= 0(1), 

since each of the two fractions are at most one when n > 2. 

Line [2] increments our counter from Theorem 4.10 and reads on average < 
eiog^^'^-^^Oogn) + 11 = eiog^^'^n + 11 bits. 

Line [3] checks if i is equal to a specific value. This check is done consecutively 
over all values of i. Thus half the time, only one bit needs to be checked, a quarter 
of the time, two bits, and so on. The average number of^bits read for this line is then 

log n 

returns false, it does so once 



Ei — , each time it is reached. Once the check in line 
9*' 



2 

1=1 



for each value of k other than 10. .00. Thus this line is executed (2^ — 1)(2" — 1) times. 
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The average number of bits read by hne [3] is 



< 



^ 2»'n(2f-l)(2"-l) + 2"-2"+i + n + l 
2n2f f2" - 1) 



n(2f - 1)(2" - 1) - 2" + n + 1 

^29+1(2" - 1) 
n{29 - 1)(2" - 1) - 2" 

29+1 



(2f - 1 

< 



2" 



n(2"-l) 

29+1 



(29 - 1) - 1 

o (2^-2) + 2 
(29 - 2) 

2+ ^ 



(29 - 1) 
= 0(1). 

Line |4] is similar to line [2| but with a slightly larger counter. It reads on average 
%< 61og(^'"^)(tlogn) + 11 = 61og(^'"^)t + 61og^^^)n + 11 bits, for c > 1. 

Line [7] reads a single bit in b, and must read all logn bits in i in order to determine 
its rank. The line of code is executed 2"^^ — 2 times over the entire sequence of 
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transitions for the counter as a whole. Thus the average number of bits read is 

(1 + logn)(2"+i - 2) 



< 



n{29 - 1)(2" - 1) + 2" - 2"+i +n + l 
2(2" - 1) + 2(2" - l)logn 

n{29 - 1)(2'^ - 1) - 2" + + 1 

2(2" - 1) + 2(2" - l)logn 
n(2f - 1)(2" - 1) - 2" 
2 + 21oe:n 



< 



< 2 

< 2 



n(29 - 1) - 1 - 

2 + 2 log n 
n{29 - 1) - 2 

21ogn -2n(2» - 1) 



n(2f - 1) 
2 logn 



n(29 - 1) 
= 0(1). 

Line |8] reads all logn bits of i, similarly to line [7} and is executed no more times 
than is linejTj Thus the average number of bits read for linejsjis -%]< 0(1). 

Line [9] has the average cost as line |2] per execution, and is executed fewer times 
over all generated states, thus < bits. 

Line 10 has an average cost per execution that is the same as line|3| but is executed 



fewer times, at most 2""*"^ — 2 times. Thus its average cost ifa < 0(1). 



Line 11 does not need to read any bits, as the state of k is already known from 
line[il 



Line 14 is similar to line [8} and does not read more bits on average than line [7j 



Its average cost is ifa ^Tj 



Line 15 , like line [TT| does not need to read any bits. 

El 



The total average number of bits read can be determined by the summation 

i=l 

It is clear that this average is at most 12 log^^'^^ n + 0(1). The number of bits written 
in the worst case remains constant, though it grows slightly to 2c + 1. Thus with 
c = 1, we can read on average O(loglogn) bits, write at most 3, and read in the 
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worst case g + log n + 1 bits. This is accomplished while keeping the space efficiency 
arbitrarily close to 1, as in the previous counter. 

The Wine Increment algorithm can be constructed as a DAT as well, as long as the 
quasi-Gray codes used in i and k can be generated with a DAT. This is done by using 



the DAT structures for i and k inside the tree from Theorem 4.14 For this DAT, we 
are using a quasi-Gray code instead of standard integers for our counting variables. 
While this changes the description of the resulting DAT, the actual structure does 
not change dramatically. First, replace the root node with a subtree Tk that reads all 
the bits of k and has leaf nodes, labeled from 1 to n^. Let leaf j be reached when 
k has rank j in its quasi-Gray code. The highest rank leaf will read the last bit string 
from the code in k, and it becomes the root of another subtree, Tj^^. This subtree 
corresponds to the main else clause of the algorithm. The subtree reads i, and has 
n leaves, one for each possible state of i. Again, let leaf j be reached when i has rank 
j in its quasi-Gray code. Then, in the leaf node j of Tj^^, the j-th bit of b is read. 
If the bit was a 0, it is changed to a 1, and k is moved ahead one state to 0. If the 
bit was a 1, then the bit is changed to a and i is changed to represent the state of 
rank j -|- 1 in its quasi-Gray code, and lastly if j is n — 1, that is this leaf represents 
the highest rank for the quasi-Gray code in i, additionally move k ahead one state to 
return it to 0. 

Other leaf nodes of Tk also become roots of subtrees Tj^ for 1 < u < n^, where Tj^ 
is rooted at the u-th child of T^, and is reached when k has rank u in its quasi-Gray 
code. These subtrees represent the many cases where the main if clause is true in 
the algorithm. The subtrees each read all of i and have n leaf nodes, one for each 
possible state of i. Each leaf node has a rule to modify i such that its rank in its 
quasi-Gray code increases one. Additionally the n-th leaf of each subtree T,^, has a 
rule to modify k to represent the state of rank -u -|- 1 in its quasi-Gray code. □ 

Corollary 4.19. Let n be a power of two such that log^'^^^ n > 11 and tlogn be an 
integer, for t > such that \og^'^^~^\t\ogn) > 11. Then for any c> 1, there exists a 
DAT of dimension d = n-|-(t-|-l) logn bits, using the Winelncrement algorithm, with 
space efficiency 1 — 0(?2^*), that reads in the worst case {t + 1) logn + 1 bits, writes 
in the worst case 2c + 1 bits, and reads on average no more than 121og*-^^''n -|- 0(1) 
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bits. 

Proof. This follows from Theorem 4.18 by substituting g = t log n. □ 



Corollary 4.20. Let n > 2^ be a power of two, and tlogn be an integer, for 
t > such that log*-^'^^"'^^(t logn) > 11. Then there exists a DAT of dimension d = 
n + {t + l)logn bits, with space efficiency 1 — 0{n~^), that reads in the worst case 
{t + 1) logn + 1 bits, writes in the worst case log*n — 3 bits, and reads on average 
0(1) bits. 

Proof. Let c = [(log* n — 4)/2j. By our choice of c and n, we know c > 1 and 
log*-^'^^ 72 > log^^°^ n > 11. Then the result follows directly from Corollary 



4.19 

□ 



Chapter 5 
Conclusion 



5.1 Summary 

We have shown in this thesis how to generate a Gray code, while reading significantly 
fewer bits on average than previously known algorithms, and how to efficiently gen- 
erate a quasi-Gray code with the same worst-case performance and improved space 
efficiency. We give a tradeoff between space-efficiency, and the worst-case number 
of bits written. When the worst case number of bits is equal to the dimension of 
the code, d, it is easy to generate all 2'^ bit strings. Our Recursive Partition Gray 
Code gives a rf- dimensional Gray code, writing at most a single bit to generate each 
successive bit string, while reading no more than 6 log d bits on average and with an 
optimal space efficiency of 1. As with all known algorithms that generate a Gray 
code, this requires all d bits are read in the worst case. 

But when the worst-case number of bits read decreases, our space-efficiency does 
also. As an improvement on previous results, we present algorithms that are able to 
retain space efficiency arbitrarily close to 1 by reading more than a constant number 
of bits on average. Our Winelncrement algorithm gives an algorithm to generate a 
quasi-Gray code with dimension d — n+{t + l) \ogn. This algorithm reads on average 
12 log^^"^^ n + 0{l) bits, while in the worst case it reads only {t + 1) log n + 1 bits. This 
is done with a space efficiency of 1 — The quasi-Gray code changes in at most 

2c +1 bits to reach any successive state in the code. 
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This trade-off highlights the initial problem which motivated this work: a lower 
bound on the number of bits read in the worst case, for a Gray code with space 
efficiency of 1. Fredman |Fre78j showed a lower bound of Q{logd) bits read in the 
worst case, and improving this bound remains an open problem. 

During our efforts to find a better lower bound, we used extensively the idea of 
a Gray code being equivalent to a Hamiltonian path (or cycle, for a cyclic code) on 
a hypercube |Lei92] . We aimed to find an example of a Gray code of dimension d, 
which read at most d — 1 bits in the worst case, or alternatively to show that such a 
code is not possible. When traversing a hypercube, a rule is a directed edge between 
two points on the hypercube, x and y, with the point x corresponding to a path in 
the DAT. If every path in the DAT has length d — 1, then it would be used in exactly 
two situations: when the bit it did not read is either a or a 1. In the hypercube, this 
means that any rule we add, from x to y, must be paired with a second parallel rule 
starting from a position one hop away from x and ending one hop away from y. Thus 
all rules must come in pairs, travelling parallel to each other, in the same direction. 



and adjacent to a shared face on the hypercube. Figure ?? shows an example of a 
hypercube with dimension three, and paired rules along its edges. Note that only one 
rule may exist on any one edge of the hypercube. 

This model is weaker than the DAT model, as not all arrangements of rules on 
the hypercube may be constructed as a DAT, however all DATs of height d—1 can 
be converted to pairs of rules on the hypercube. Thus, showing such a construction is 
not possible on the hypercube would also show it is not possible for a DAT. If paired 



rules, such as those in Figure ??, cannot be placed without overlapping such that a 
Hamiltonian (cycle) path is formed on the hypercube, then it is also not possible to 
construct a DAT that generates a (cyclic) Gray code of dimension d without reading 
d bits in the worst case. For small d, it quickly becomes obvious that such pairs of 
rules cannot be used to construct a Hamiltonian cycle on the hypercube. For larger 
dimensions, we conjecture that it is never possible to construct a Hamiltonian cycle 
using pairs of parallel face-adjacent rules on the hypercube. 
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Figure 5.1: Rules for traversing a hypercube of dimension d — 3 that require reading 
d—1 bits. The rules are directed dashed hues, each of which shows the bit that would 
be changed when the code's current bit string is equal to the label of the rule's source 
vertex in the hypercube. A dark line connects two states that differ only in the bit 
not read for the two paired rules adjacent to it. 
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5.2 Future work 

The main open problem remains: finding a better lower bound for the worst-case 
number of bits read by a Gray code with space efficiency of 1, or finding an example 
of a such Gray code that always reads less than d bits to generate the next bit string 
in the code. 

When considering the average number of bits read, is it possible to find a Gray 
code with space efficiency 1 that requires reading less bits than the Recursive Partition 
Gray Code? As a Gray code, what properties does the RPGC have, such as run length, 
balance of bit flips, and others discussed by Savage? jSav97j 

What other properties does the Recursive Partition Gray Code have, such as 

Is it possible to flnd a quasi-Gray code with space efficiency 1, or a quasi-Gray 
code that reads fewer bits on average or in the worst case than the Wine Increment 
algorithm? Is there a lower bound on the space efficiency in relation to the number 
of bits read on average? Fredman |Fre78j showed a tradeoff between the number of 
bits written and read in the worst case. Can a tighter tradeoff be shown, between the 
worst-case number of bits written, the worst-case number of bits read, the average 
number of bits read, and/or the space efficiency of a quasi-Gray code? 

Our Recursive Partition Gray Code does provide a counter-example to any efforts 
to show a best-case bound of more than Q{logd), and our hope is that this work will 
contribute to a better understanding of the problem, and eventually, a tighter lower 
bound in the case of generating a space-optimal Gray code. 
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